CTGCAGTCTATTGGATGAAGAGTGTACATATTCATATAATTCTTAAAGTAGGCAGAAATTAAAG 
GGGATGGAAATATATACTTGTACTGCCTTAGATAGTCACCAGGATGTTGTTACAGTCTTCGTTT 
ACTGCTTCTGAAGCCTATACTGATAGAATTAATAAAATACTGAGAGAGAGAGAGAGGGACAGAG 
AGAGAGAGGGGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAAGA 
GAAGAAAAC AAGGT S AAGC CATC TGC TT AAC TT ATGTC C AC ATTC TC TC AAG AGC ATTGTCCT A 
TTTGTAGAATTATCTATATTGTTAAGAATCATCTCCATTGTTAAGATTTTGTGGGCTGGAGATC 
CAGCTCTGTTGATAAAGTGCTTGCCTAACATGCATGAAGTCCTAGGTTCTATTCCCAAGGCTAC 
ATAAAAC C TTGTGTTGTG ATG AATGC C TGT AATC C C AGT AC GC AGC AAGG AG AG AC AAGGAGGA 
TCAGAAGCTTAAGGACATCATTTTGTACATAGTGAGTTTGAGGAAAGCTGAGGTTACATGGAAC 
TCTCTCTCTCTCAAAAACAAAACAAAACAAAACAAAACCTTCTACTAATATTCTGGATTCTGTT 
TGATTTTTAGGATCTCAAGAGCATGCTGACGTCATTTATGTGTTTCCATCAGATACAGACAGAG 
ATCATAAACATTTAACTCATTGATTATATGTTGAGAGTTGTCCCTCAAGAACCAATGGCCAAAC 
ATCCACTGAGGATACACGGAAGCTTAGAAAATCTCTAATTAAAATCCTGACATAATGGAAGTGC 
TC AC AAAC C AGC C AAC AC C T AAT AAAAC C AGTGGC AAG AGC AAC AAC TC GGC ATTTTTC T AC TT 
TGAATCCTGCCAACCCCCTTTTCTAGCCATACTCTTGCTACTCATAGCATATACTGTGATCCTA 
ATCATGGGCATTTTTGGAAACCTCTCTCTTATCATCATCATCTTTAAGAAACAGAGAGAAGCTC 
AAAATG TTACC AAC ATAC TGATTGC C AAC C TGTC C C TC TC TG AC ATC TTGGTGTGTGTC ATGTG 
CATCCCTTTTACGGTCATCTACACTCTGATGGACCACTGGGTATTTGGGAACACTATGTGTAAA 
CTCACTTCCTACGTGCAAAGTGTCTCAGTTTCTGTGTCCATATTCTCCCTTGTGTTGATTGCTA 
TTGAACGATATCAGCTGATTGTGAACCCCCGTGGCTGGAAACCCAGAGTAGCTCATGCCTATTG 
GGGGATCATCTTGATTTGGCTCATTTCTCTGACATTGTCTATTCCCTTATTCCTGTCCTACCAC 
CTCACCAATGAGCCCTTTCATAATCTCTCTCTCCCTACTGACATCTACACCCACCAGGTAGCTT 
GTGTGGAGATTTGGCCTTCTAAACTGAACCAACTCCTCTTTTCTACATCATTATTTATGCTCCA 
GTATTTTGTCCCTCTGGGTTTCATTCTTATCTGCTACCTGAAGATCGTTCTCTGCCTCCGAAAA 
AGAACTAGGCAGGTGGACAGGAGAAAGGAAAATAAGAGCCGTCTCAATGAGAACAAGAGGGTAA 
ATGTGATGTTGATTTCCATCGTAGTGACTTTTGGAGCCTGCTGGTTGCCCTTGAACATTTTCAA 
TGTCATCTTCGACTGGTATCATGAGATGCTGATGAGCTGCCACCACGACCTGGTATTTGTAGTT 
TGCCACTTGATTGCTATGGTTTCTACTTGCATAAATCCTCTCTTTTATGGATTTCTCAACAAAA 
ACTTCCAGAAGGATCTAATGATGCTTATTCACCACTGTTGGTGTGGTGAACCTCAGGAAAGTTA 
TGAAAATATTGCCATGTCTACTATGCACACAGATGAATCCAAGGGATCATTAAAACTGGCTCAC 
ATACCAACAGGCATATAGAAACTGGTAAGCAAAATCAAAGCCCTTCTGTTATGAAAGAAAGAGA 
AGAAATAGTATGGAATAGGGCAAGGTGCAGAGGAAGCCAGACTTAAACACATAATATCTTTGGG 
CCCAGTTTTGCTTTAAGTTAAGCATGTCTACTCCATTCAGCCATAGAACACACAGAGATTTATC 
CCTACCCTTTCTTTTTTTCCTTTGGAAGAATAATAACTTAAACAACCTAGACATCATTACTGAG 
GAAGAGAACAAAAATGAGAGAGCATACAAGGACAGCAGAGATGTCTGGGGTACAAAATTCACGT 
TATTCGCTGGAATAGCTAGAAAGTTATTAGTTGTGCTGCAG (SEQ ID N0:1) 



FIGURE 1 



underlined = deleted in targeting construct 

[ ] = sequence flanking Neo insert in targeting construct 



[CTGCAGTCTATTGGATGAAGAGTGTACATATTCATATAATTCTTAAAGTAGGCAGAAAT 
TAAAGGGGATGGAAATATATACTTGTACTGCCTTAGATAGTCACCAGGATGTTGTTACAG 
TCTTC GTTTACTGCTTCTGAAGCC TATACTGATAGAATTAATAAAATAC TGAGAGAGAGA 
GAGAGGGACAGAGAGAGAGAGGGGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGA 
GAGAGAGAGAGAGAAGAGAAGAAAACAAGGTSAAGCCATCTGCTTAACTTATGTCCACAT 
TCTCTCAAGAGCATTGTCCTATTTGTAGAATTATCTATATTGTTAAGAATCATCTCCATT 
GTTAAGATTTTGTGGGCTGGAGATCCAGCTCTGTTGATAAAGTGCTTGCCTAACATGCAT 
GAAGTCCTAGGTTCTATTCCCAAGGCTACATAAAACCTTGTGTTGTGATGAATGCCTGTA 
ATC C C AGTACGC AGC AAGGAGAGAC AAGGAGGATC AGAAGC TTAAGGAC ATC ATTTTGTA 
CATAGTGAGTTTGAGGAAAGCTGAGGTTACATGGAACTCTCTCTCTCTCAAAAACAAAAC 
AAAACAAAACAAAACCTTCTACTAATATTCTGGATTCTGTTTGATTTTTAGGATCTCAAG 
AGCATGCTGACGTCATTTATGTGTTTCCATCAGATACAGACAGAGATCATAAACATTTAA 
CTCATTGATTATATGTTGAGAGTTGTCCCTCAAGAACCAATGGCCAAACATCCACTGAGG 
AT AC AC GGAAGCTTAGAAAATCTCTAATTAAAATC C TGAC ATAATGGAAGTGCTC ACAAA 
CCAGCC ] AACACCTAATAAAACCAGTGGCAAGAGCAACAACTCGGCATTTTTCTACTTTG 
AATCCTGCCAACCCCCTTTTCTAGCCATACTCTTGCTACTCATAGCATATACTGTGATCC 
TAATCATGGGCATTTTTGGAAACCTCTCTCTTATCATCATCATCTTTAAGAAACAGAGAG 
AAGCTCAAAATGTTACCAACATACTGATT [GCCAACCTGTCCCTCTCTGACATCTTGGTG 
TGTGTCATGTGCATCCCTTTTACGGTCATCTACACTCTGATGGACCACTGGGTATTTGGG 
AACACTATGTGTAAACTCACTTCCTACGTGCAAAGTGTCTCAGTTTCTGTGTCCATATTC 
TCCCTTGTGTTGATTGCTATTGAACGATATCAGCTGATTGTGAACCCCCGTGGCTGGAAA 
CCCAGAGTAGCTCATGCCTATTGGGGGATCATCTTGATTTGGCTCATTTCTCTGACATTG 
TCTATTCCCTTATTCCTGTCCTACCACCTCACCAATGAGCCCTTTCATAATCTCTCTCTC 
C C TAC TGAC ATCTAC AC C C ACC AGGTAGC TTGTGTGGAGATTTGGCC TTC TAAAC TGAAC 
CAACTCCTCTTTTCTACATCATTATTTATGCTCCAGTATTTTGTCCCTCTGGGTTTCATT 
CTTATCTGCTACCTGAAGATCGTTCTCTGCCTCCGAAAAAGAACTAGGCAGGTGGACAGG 
AGAAAGGAAAATAAGAGC CGTCTC AATGAGAAC AAGAGGGTAAATGTGATGTTGATTTC C 
ATCGTAGTGACTTTTGGAGCCTGCTGGTTGCCCTTGAACATTTTCAATGTCATCTTCGAC 
TGGTATCATGAGATGCTGATGAGCTGCCACCACGACCTGGTATTTGTAGTTTGCCACTTG 
ATTGCTATGGTTTCTACTTGCATAAATCCTCTCTTTTATGGATTTCTCAACAAAAACTTC 
CAGAAG3 GATCTAATGATGCTTATTCACCACTGTTGGTGTGGTGAACCTCAGGAAAGTTA 
TGAAAATATTGCCATGTCTACTATGCACACAGATGAATCCAAGGGATCATTAAAACTGGC 
TCACATACCAACAGGCATATAGAAACTGGTAAGCAAAATCAAAGCCCTTCTGTTATGAAA 
GAAAGAGAAGAAATAGTATGGAATAGGGCAAGGTGCAGAGGAAGCCAGACTTAAACACAT 
AATATCTTTGGGCCCAGTTTTGCTTTAAGTTAAGCATGTCTACTCCATTCAGCCATAGAA 
CACACAGAGATTTATCCCTACCCTTTCTTTTTTTCCTTTGGAAGAATAATAACTTAAACA 
ACCTAGACATCATTACTGAGGAAGAGAACAAAAATGAGAGAGCATACAAGGACAGCAGAG 
ATGTCTGGGGTACAAAATTCACGTTATTCGCTGGAATAGCTAGAAAGTTATTAGTTGTGC 
TGCAG 
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Gene Sequence Structure 
* 



846 bp 



Sequence Deleted 



1047 bp 



Size of CDS: 2281 bp 




Targeting Vector* (genomic sequence) 
Construct Number: 2762 



Arm Length: 
5': 5 kb 
3': 0.5 kb 



Cf Not drawn to scale 



LacZ-Neo 

Cassette 



3* arm 




5 1 >TTTAGGATCTCAAGAGCATGC 
TGACGTCATTTATGTGTTTCCATC 
AGATACTGACAGAGATCATAAACA 
TTTAACTCATTGATTATATGTTGA 
GAGTTGTCCCTCAAGAACCAATGG 
C C AAAC ATC C AC TGAGGATAC AC G 
GAAGC TTAGAAAATCTC TAATTAA 
AATC C TGAC AT AATGGAAGTGC TC 
ACAAACCAGCCO 1 
(SEQ ID NO: 2) 



5 1 >GCCAACCTGTCCCTCTCTGAC 
ATCTTGGTGTGTGTCATGTGCATC 
CCTTTTACGGTCATCTACACTCTG 
ATGGACCACTGGGTATTTGGGAAC 
AC T ATGTGT AAACTC AC TTC C T AC 
GTGCAAAGTGTCTCAGTTTCTGTG 
TCCATATTCTCCCTTGTGTTGATT 
GCTATTGAACGATATCAGCTGATT 
GTGAACCCCCGO 1 
( SEQ ID NO: 3) 
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